Semi-Automatic Query Expansion using Most Discriminant Words
نویسنده
چکیده
Most casual users of IR systems type short queries. With current indexing technologies, short queries return enormous amount of results that may be impossible to examinate carefully. If users not find what they are looking for between the first 10/100 results, they may stop searching, losing the important results wrongly ranked by the search engine. While users generally know what they are looking for, the task of express their desires in a compact, precise, written form, i.e. the query, represent a real problem. Word usage is in fact both domain and user dependent, and may easily mislead the search engine. In this report I investigate a new method for query expansion, that exploiting the user’s feedback on some discriminant words, try to increase the focus on the user’s
منابع مشابه
Creation and Maintenance of Query Expansion Rules
In an information retrieval system, a thesaurus can be used for query expansion, i.e. adding words to queries in order to improve recall. We propose a semi-automatic and interactive approach for the creation and maintenance of domain-specific thesauri for query expansion. Domain-specific thesauri are especially required in highly technical domains where the use of general thesauri for query exp...
متن کاملQuery Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملAnnotation and verification of sense pools in OntoNotes
The paper describes the OntoNotes, a multilingual (English, Chinese and Arabic) corpus with large-scale semantic annotations, including predicate-argument structure, word senses, ontology linking, and coreference. The underlying semantic model of OntoNotes involves word senses that are grouped into so-called sense pools, i.e., sets of near-synonymous senses of words. Such information is useful ...
متن کاملEnglish-Japanese Cross-lingual Query Expansion Using Random Indexing of Aligned Bilingual Text Data
Vector space models can be used for extracting semantically similar words from the co-occurrence statistics of words in large text data. In this paper, we report on our NTCIR 2002 experiments using the Random Indexing vector space method for extracting an English-Japanese cross-lingual thesaurus from aligned English-Japanese bilingual data. The crosslingual thesaurus has been used for automatic...
متن کاملAutomatic query expansion and word sense disambiguation with long and short queries using WordNet under vector model
This paper describes the experimentation conducted to test the effectiveness of automatic query expansion and word sense disambiguation (WSD) using short and long query of a topic TREC under vector model. We ran different experiments generating queries under vector model using linguistic information extracted from WordNet. Results show that query expansion with short queries and long queries is...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005